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In response to the Office Action mailed on February 25, 2004 and the 
advisory action of June 12, 2004, please amend the above-identified Application 
as follows: 



IN THE CLAIMS 

1 . (Currently Amended) In a server, a method for providing information 
suitable for audio output, the method comprising: 

receiving a first set of information over a network based on a 
request for the first set of information , receiving the first set of information 
further comprising: 

receiving speech information specifying the first set of 
information: 

generating a text reguest for the first set of information 
based on an acoustic speech recognition (ASR) technigue applied 
to the speech information, generating including interpreting at least 
one primitive construct based on the speech information and 
generating at least one additional primitive construct based on a 
request for a user-defined command, and 

submitting the text reouest over the network : 
accessing a tagged document in response to receiving the first set 
of information, accessing the tagged document further including: 

determining an identity of the request for the first set of information; 

and 

accessing the tagged document based on the identity of the 
request, wherein the identity of the request is based on at least one of an 
identifier for an originator of the request and an identifier for a destination 
of the request; and 

generating a second set of information suitable for audio output 
based on the first set of information and the tagged document. 
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2. (Original) The method of claim 1 , wherein: 

the step of receiving the first set of information comprises receiving 
a web page based on a Uniform Resource Locator (URL) request for the 
web page; 

the step of accessing the tagged document comprises accessing 
an Extensible Markup Language (XML) document; and 

the step of generating the second set of information comprises 
generating filtered web content suitable for audio output based on the 
web page and the XML document. 

3. (Cancelled). 

4. (Cancelled) 

5. (Cancelled) 

6. (Currently Amended) The method of claim 23, wherein the step of 
generating the text request comprises applying a case-logic technique to 
the speech information. 

7. (Cancelled) 

8. (Cancelled) 

9. (Original) The method of claim 1 , wherein the step of generating the 
second set of information suitable for audio output comprises: 

selecting at least one portion of the first set of information that is 
suitable for audio output; and 
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generating the second set of information based on selecting the at 
least one portion of the first set of information. 

1 0. (Original) The method of claim 1 , wherein the step of generating the 
second set of information suitable for audio output comprises: 

generating text data suitable for audio output based on the first set 
of information and the tagged document, and 

generating audio data based on the text data. 

1 1 . (Original) The method of claim 1 0, wherein the step of generating the text 
data suitable for audio output comprises generating at least one response 
and the step of generating the audio data based on the text data 
comprises applying a text-to-speech (TTS) technique to the at least one 
response. 

12. (Original) The method of claim 1 , wherein the step of accessing the 
tagged document is performed based on the request for the first set of 
information and approximately concurrently with the step of receiving the 
first set of information. 

1 3. (Original) The method of claim 1 , wherein each of the first set of 
information, the tagged document, and the second set of information is at 
least one of a Hypertext Markup Language (HTML) page, an Extensible 
Markup Language (XML) page, a Virtual Reality Modeling Language 
(VRML) page, and a Standard Generic Markup Language (SGML) page. 



14. 



(Currently Amended) A system for providing information suitable for audio 
output, the system comprising: 
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a document database configured for storing a plurality of tagged 
documents; and 

a server comprising an executable resource, wherein the 
executable resource receives a first set of information over a network 
based on a request for the first set of information , the executable resource 
further operable to generate a text request for the first set of information 
based on an acoustic speech recognition (ASR) technique applied to the 
speech information, and submits the text request over the network, 
generating the text request further including: 

receiving speech information specifying the first set of 

information Interpreting at least one primitive construct based on 

the speech information: and 

generating at least one additional primitive construct based 
on a reguest for a user-defined command : 

accesses a tagged document from the document database based 
on receiving the first set of information by 

determining an identity of the request for the first set of 
information; and 

accessing the tagged document based on the identity of the 
request, wherein the identity of the request is based on at least one 
of an identifier for an originator of the request and an identifier for a 
destination of the request; and 

generates the second set of information suitable for audio output 
based on the first set of information and the tagged document. 

1 5. (Original) The system of claim 14, wherein the first set of information is a 
web page based on a Uniform Resource Locator (URL) request for the 
web page; the tagged document is an Extensible Markup Language (XML) 



U.S. Application No.: 09/592.302 Attorney Docket No.: CIS00-2410 

-6- 

document; and the second set of information is filtered web content 
suitable for audio output based on the web page and the XML document. 

1 6. (Original) The system of claim 14, wherein the executable resource 
receives speech information specifying the first set of information, 
generates a text request for the first set of information based on an 
acoustic speech recognition (ASR) technique applied to the speech 
information, and submits the text request over the network. 

17. (Cancelled). 

18. (Cancelled). 

1 9. (Currently Amended) The system of claim 1448, wherein the executable 
resource applies a case-logic technique to the speech information to 
generate the text request. 

20. (Cancelled) 

21. (Cancelled) 

22. (Original) The system of claim 14, wherein the executable resource 
selects at least one portion of the first set of information that is suitable for 
audio output, and generates the second set of information based on 
selecting the at least one portion of the first set of information. 
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23. (Original) The system of claim 14, wherein the executable resource 
generates text data suitable for audio output based on the first set of 
information and the tagged document, and the executable resource 
generates audio data based on the text data. 



24. (Original) The system of claim 23, wherein the text data comprises at least 
one response, and the executable resource applies a text-to-speech (TTS) 
technique to the at least one response to generate the audio data. 



25. (Original) The system of claim 14, wherein the executable resource, in an 
approximately concurrent time frame: 

accesses the tagged document based on the request for the first 
set of information, 

and receives the first set of information. 

26. (Original) The system of claim 14, wherein each of the first set of 
information, the tagged document, and the second set of information is at 
least one of a Hypertext Markup Language (HTML) page, an Extensible 
Markup Language (XML) page, a Virtual Reality Modeling Language 
(VRML) page, and a Standard Generic Markup Language (SGML) page. 



27. (Currently Amended) A computer program product that includes a 
computer readable medium having instructions stored thereon for 
providing information suitable for audio output, such that the instructions, 
when carried out by a computer, cause the computer to perform the steps 
of: 

receiving a first set of information over a network based on a 
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request for the first set of information , receiving the first set of information 
further comprising: 

receiving speech information specifying the first set of 
information; 

generating a text reguest for the first set of information 
based on an acoustic speech recognition (ASR) technioue applied 
to the speech information, generating including interpreting at least 
one primitive construct based on the speech information and 
generating at least one additional primitive construct based on a 
request for a user-defined command, and 

submitting the text reguest over the network : 
accessing a tagged document in response to receiving the first set 
of information, accessing the tagged document further including: 

determining an identity of the request for the first set of 
information; and 

accessing the tagged document based on the identity of the 
request, wherein the identity of the request is based on at least one 
of an identifier for an originator of the request and an identifier for a 
destination of the request; and 

generating a second set of information suitable for audio output 
based on the first set of information and the tagged document. 

28. (Original) The computer program product of claim 27, wherein: 

the step of receiving the first set of information comprises receiving 
a web page based on a Uniform Resource Locator (URL) request for the 
web page; 

the step of accessing the tagged document comprises accessing 
an Extensible Markup Language (XML) document; and 

the step of generating the second set of information comprises 
generating filtering web content suitable for audio output based on the 
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web page and the XML document. 
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29. (Currently Amended) A computer program propagated signal product 
embodied in a propagated medium, having instructions for providing 
information suitable for audio output, such that the instructions, when 
carried out by a computer, cause the computer to perform the steps of: 

receiving a first set of information over a network based on a 
request for the first set of information , receiving the first set of information 
further comprising: 

receiving speech information specifying the first set of 
information; 

generating a text reguest for the first set of information 
based on an acoustic speech recognition (ASR) technioue applied 
to the speech information, generating including interpreting at least 
one primitive construct based on the speech information and 
generating at least one additional primitive construct based on a 
re quest for a user-defined command, and 

submitting the text reguest over the network : 
accessing a tagged document in response to receiving the first set 
of information, accessing the tagged document further including: 

determining an identity of the request for the first set of 
information; and 

accessing the tagged document based on the identity of the 
request, wherein the identity of the request is based on at least one 
of an identifier for an originator of the request and an identifier for a 
destination of the request; and 

generating a second set of information suitable for audio output 
based on the first set of information and the tagged document. 



U.S. Application No.: 09/592.302 Attorney Docket No.: CIS00-2410 

-10- 

30. (Original) The computer program propagated signal product of claim 29, 
wherein: 

the step of receiving the first set of information comprises receiving 
a web page based on a Uniform Resource Locator (URL) request for the 
web page; 

the step of accessing the tagged document comprises accessing 
an Extensible Markup Language (XML) document; and 

the step of generating the second set of information comprises 
generating filtered web content suitable for audio output based on the 
web page and the XML document. 

31 . (Currently Amended) A system for providing information suitable for audio 
output, the system comprising: 

a document database configured for storing a plurality of tagged 
document pages; 

means for producing a second set of information suitable for audio 
output, wherein the producing means receives a first set of information 
over a network based on a request for the first set of information , receiving 
the first set of information further comprising: 

receiving speech information specifying the first set of 

information: 

generating a text reguest for the first set of information 
based on an acoustic speech recognition (ASR) technique applied 
to the speech information, generating including interpreting at least 
one primitive construct based on the speech information and 
generating at least one additional primitive construct based on a 
request for a user-defined command, and 

submitting the text reguest over the network : 
accesses a tagged document from the document database based 
on receiving the first set of information by: 
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determining an identity of the request for the first set of 
information; and 

accessing the tagged document based on the identity of the 
request, wherein the identity of the request is based on at least one 
of an identifier for an originator of the request and an identifier for a 
destination of the request; and 

generatinges the second set of information suitable for audio output 
based on the first set of information and the tagged document. 

32. (Original) The system of claim 31 , wherein the first set of information is a 
web page based on a Uniform Resource Locator (URL) request for the 
web page; the tagged document is an Extensible Markup Language (XML) 
document; and the second set of information is filtered web content 
suitable for audio output based on the web page and the XML document. 

33. (Currently Amended) A method for navigating a web by voice in a server 
configured for executing voice web applications, the method comprising: 

requesting a web page based on a voice web navigation request^ 
requesting the web page further comprising: 

receiving speech information specifying the first set of 
information; 

generating a text reguest for the first set of information 
based on an acoustic speech recognition (ASR) technigue applied 
to the speech information, generating including interpreting at least 
one primitive construct based on the speech information and 
generating at least one additional primitive construct based on a 
request for a user-defined command, and 

submitting the text reguest over the network ; 
receiving a retrieved web page based on the voice web navigation 
request; 
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accessing an extensible markup language page in response to 
receiving the retrieved web page, accessing the tagged document further 
including: 

determining an identity of the request for the first set of 
information; and 

accessing the tagged document based on the identity of the 
request, wherein the identity of the request is based on at least one 
of an identifier for an originator of the request and an identifier for a 
destination of the request; 

generating filtered web content suitable for audio output based on 
the retrieved web page and the extensible markup language page; and 

generating the at least one audio output file based on the filtered 
web content . 

34. (Original) The method of claim 33, wherein the step of requesting the web 
page based on the voice web navigation request comprises the steps of: 

receiving speech information specifying the web page; 

generating a text request for the web page based on an acoustic 
speech recognition (ASR) technique applied to the speech information, 
and 

submitting the text request over the network. 

35. (Original) The method of claim 33, wherein the step of accessing the 
extensible markup language document in response to receiving the 
retrieved web page comprises: 

determining an identity of the voice web navigation request for the 
web page, and 

accessing the extensible markup language page based on the 
identity of the voice web navigation request. 
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36. (Original) The method of claim 35, wherein the identity of the request is 
based on at least one of an identifier for an originator of the voice web 
navigation request and an identifier for a destination of the voice web 
navigation request. 

37. (Original) The method of claim 33, wherein the step of generating the 
filtered web content suitable for audio output comprises: 

generating text data suitable for audio output based on the 
retrieved web page and the extensible markup language document, and 
generating audio data based on the text data. 

38. (Previously Added) The method of claim 1 wherein the method of 
accessing a tagged document comprises accessing a plurality of tagged 
documents, the plurality of tagged documents to define user interface 
logistics and to operate the server; and, 

wherein the method of generating a second set of information 
comprises generating a second set of information suitable for audio input 
based on the first set of information and the plurality of tagged documents. 

39. (Previously Added) The method of claim 38 wherein the plurality of 
tagged documents includes at least one menu document, at least one 
activity document, at least one decision document and at least one 
application state document. 

40. (Previously Added) The method of claim 38 wherein the plurality of 
tagged documents includes at least one filtering documents to be applied 
to the first set of information to generate the second set of information 
suitable for audio output. 
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41 . (Previously Added) The method of claim 1 wherein the step of generating 
the second set of information further comprises the step of executing voice 
application operations from the tagged document to generate the 
information suitable for audio output. 



